Module and method for searching named entity of terms from the named entity database using named entity database and mining rule merged ontology schema

ABSTRACT

A module determining a named entity of terminology includes a named entity dictionary and mining rule database storing the named entity dictionary where named entity of terminology combined with the ontology schema and connected to concept is defined, and a mining rule database where a mining rule configured with mining pattern and RDF triple combined with the ontology schema and connected to relationship name is defined; a named entity and mining rule search unit searching for corresponding mining rule and named entity from the mining rule database and the named entity dictionary using inputted mining pattern and terminology included in the mining pattern; and a names entity selection unit selecting, if two or more named entities corresponding to the terminology are searched, named entity matching to the concept configuring the RDF triple of the searched mining rule as named entity of the terminology among the searched named entities.

CROSS REFERENCE TO RELATED APPLICATION

This application is a division of U.S. patent application Ser. No.13/153,404, filed on Jun. 4, 2011, the disclosure of which isincorporated by reference in its entirety for all purposes.

BACKGROUND

1. Field

The present invention relates to a module and method for determining anamed entity of a terminology using a named entity dictionary and amining rule combined with an ontology schema, in which if two or morenamed entities corresponding to a terminology connected to the ontologyschema are searched, a mining rule connected to a relationship name ofthe ontology schema is searched from a mining rule database, and a namedentity of the terminology is selected using the mining rule.

2. Description of the Related Art

Recently, data models using ontology are spotlighted in relation tostudies on semantic webs. The ontology is a working model of entitiesand interactions existing in a specific domain.

That is, the ontology conceptualizes and specifies knowledge in thespecific domain and can be defined as a network or a graph having arelationship among the concepts used in the domain.

The ontology is constructed by obtaining knowledge related toconstitutional elements of the ontology to be constructed in theontology, such as concepts, attributes of the concepts, and connectionrelations among the concepts, from documents related to a specificfield, defining concepts and attributes, and setting relations among theconcepts.

In addition, in order to give meanings to web resources, the ontologycan be expressed as a resource description framework (RDF) which is a3-dimensional structure of resource, attribute, and attribute value.

The RDF is constructed as a concept of resource, property, andstatement. All things of an inputted document are expressed asresources, and the RDF can express properties of each resource andrelation of the resource with the other resources.

A RDF network can be constructed by detecting named entities from a textand converting the named entities into corresponding RDF triples usingmining rules.

Here, the named entity is a word or a set of words that can beclassified, such as a name of a person or an organization, a title ofmusic, a name of broadcasting, or a name of a place, and the mining ruleis a rule configured with a mining pattern and an RDF triple that can beobtained from the mining pattern.

A variety of methods can be used to detect a named entity and to apply amining rule, and an RDF network can constructed by detecting a namedentity and applying a mining rule using a named entity dictionary thatpreviously stores a relation between a terminology and a named entity ora mining rule database that previously stores a relation between amining pattern and an RDF triple.

In order to detect a named entity or apply a mining rule using the namedentity dictionary or the mining rule database, it is required to updatedata stored in the named entity dictionary or the mining rule databaseand to solve ambiguity of the named entity.

SUMMARY

Therefore, the present invention has been made in view of the aboveproblems, and it is an object of the present invention to provide amodule and method for determining a named entity of a terminology usinga named entity dictionary and a mining rule combined with an ontologyschema, and a recording medium which can be read by an electronic devicethat has recorded the method, in which the named entity of theterminology connected to a concept (class) of the ontology schema isselected using the named entity dictionary and a mining rule database.

To accomplish the above object, according to one aspect of the presentinvention, there is provided a module for determining a named entity ofa terminology using a named entity dictionary and a mining rule combinedwith an ontology schema, the module including: a named entity dictionaryand mining rule database for storing the named entity dictionary inwhich a named entity of a terminology combined with the ontology schemaand connected to a concept (class) is defined, and a mining ruledatabase in which a mining rule configured with a mining pattern and aresource description framework (RDF) triple combined with the ontologyschema and connected to a relationship name (property) is defined; anamed entity and mining rule search unit for searching for acorresponding mining rule and a named entity respectively from themining rule database and the named entity dictionary using an inputtedmining pattern and a terminology included in the mining pattern; and anames entity selection unit for selecting, if two or more named entitiescorresponding to the terminology are searched, a named entity matchingto the concept configuring the RDF triple of the searched mining rule asa named entity of the terminology among the searched named entities.

The module for determining a named entity of a terminology using a namedentity dictionary and a mining rule combined with an ontology schema mayfurther include a verification interface for displaying one or more ofthe named entity selected for the terminology, a representativeterminology among terminologies, and the RDF triple of the mining rule.

The named entity dictionary may connect and store authority datacomprising a named entity corresponding to the concept (class) of theontology schema, a terminology classified as the named entity, anidentifier of the terminology, a representative terminology, and anidentifier of the representative terminology in one format.

The named entity selection unit may select a matching named entity bycomparing the concept configuring an RDF triple of the searched miningrule with the two more searched named entities.

According to another aspect of the present invention, there is provideda method for determining a named entity of a terminology using a namedentity dictionary and a mining rule combined with an ontology schema,which searches for a named entity or an identifier of a terminologyrecognized from an inputted text document using the named entitydictionary in which a named entity of a terminology combined with theontology schema and connected to a concept (class) is defined, and amining rule database in which a mining rule configured with a miningpattern and a resource description framework (RDF) triple combined withthe ontology schema and connected to a relationship name (property) isdefined, the method including the steps of: searching for acorresponding mining rule and a named entity respectively from themining rule database and the named entity dictionary using an inputtedmining pattern and a terminology included in the mining pattern; and iftwo or more named entities corresponding to the terminology aresearched, selecting a named entity matching to the concept configuringthe RDF triple of the searched mining rule as a named entity of theterminology among the searched named entities.

The step of selecting a named entity corresponding to the terminologyusing the two or more searched named entities and the one searchedmining rule may include the steps of: extracting the concept from thetwo or more searched named entities and the RDF triple of the miningrule; and selecting a named entity corresponding to the extractedconcept as the named entity of the terminology.

The named entity dictionary may connect and store authority datacomprising a named entity corresponding to the concept (class) of theontology schema, a terminology classified as the named entity, anidentifier of the terminology, a representative terminology, and anidentifier of the representative terminology in one format.

In the step of searching for a corresponding named entity and a miningrule respectively from the named entity dictionary and the mining ruledatabase using the terminology and mining pattern, if there is aplurality of named entities corresponding to the recognized terminologyin the named entity dictionary, all the plurality of the named entitiesmay be searched.

According to still another aspect of the present invention, there isprovided a recording medium that can be read by an electronic device,for recording a program for executing the method for determining a namedentity of a terminology using a named entity dictionary and a miningrule combined with an ontology schema.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a view showing an embodiment of a module for determining anamed entity of a terminology using a named entity dictionary and amining rule combined with an ontology schema according to an aspect ofthe present invention.

FIG. 2 is a view showing another embodiment of a module for determininga named entity of a terminology using a named entity dictionary and amining rule combined with an ontology schema according to an aspect ofthe present invention.

FIG. 3 is a flowchart illustrating an embodiment of a method fordetermining a named entity of a terminology using a named entitydictionary and a mining rule combined with an ontology schema accordingto another aspect of the present invention.

FIG. 4 is a flowchart illustrating another embodiment of a method fordetermining a named entity of a terminology using a named entitydictionary and a mining rule combined with an ontology schema accordingto another aspect of the present invention.

FIG. 5 is a flowchart illustrating a process of selecting a named entitysuitable for a terminology among a plurality of named entities in amethod for determining a named entity of a terminology using a namedentity dictionary and a mining rule combined with an ontology schemaaccording to another aspect of the present invention.

FIG. 6 is a conceptual view showing a connection relation between anamed entity dictionary and a mining rule database combined with anontology schema according to an aspect of the present invention.

DETAILED DESCRIPTION

The preferred embodiments of the present invention will be hereafterdescribed in detail, with reference to the accompanying drawings.However, it should be understood that all the modifications, equivalentsand substitutions are included within the scope of the invention definedby the claims.

Hereinafter, a module and method for determining a named entity of aterminology using a named entity dictionary and a mining rule combinedwith an ontology schema, and a recording medium which can be read by anelectronic device that has recorded the method will be described indetail with reference to the accompanying drawings. Furthermore, in thedrawings illustrating the embodiments of the present invention, elementshaving like functions will be denoted by like reference numerals anddetails thereon will not be repeated.

FIG. 1 is a view showing an embodiment of a module for determining anamed entity of a terminology using a named entity dictionary and amining rule combined with an ontology schema according to an aspect ofthe present invention, and FIG. 2 is a view showing another embodimentof a module for determining a named entity of a terminology using anamed entity dictionary and a mining rule combined with an ontologyschema according to an aspect of the present invention. FIG. 6 is aconceptual view schematically showing a connection relation between anamed entity dictionary and a mining rule database combined with anontology schema according to an aspect of the present invention.

According to a first embodiment of the present invention, as shown inFIG. 1, a module 100 for determining a named entity of a terminologyusing a named entity dictionary and a mining rule combined with anontology schema includes a named entity dictionary and mining ruledatabase 110, a mining pattern generation unit 120, a named entity andmining rule search unit 130, a named entity selection unit 140, and averification interface 150.

The named entity dictionary and mining rule database 110 may store anamed entity dictionary and a mining rule database independently or as adatabase.

The named entity dictionary is a database in which a named entity of aterminology combined with an ontology schema and connected to a concept(class) is defined and stored, and the mining rule database is adatabase in which a mining rule configured with a mining pattern and aresource description framework (RDF) triple combined with an ontologyschema and connected to a relationship name (property) is defined andstored.

As shown in FIG. 6, in the named entity dictionary 610 and the miningrule database 630 associated with the ontology schema 620, a concept(class) of the ontology schema 620 is associated with a named entity ofthe named entity dictionary 610, and a relationship name (property) ofthe ontology schema 620 is associated with a relationship name set as akey value of the mining rule database.

For example, a religion, a person, a sport event type, and the like,i.e., concepts of an ontology schema, are defined as a named entity,i.e., a key value of the named entity dictionary, and a specificinstance can be defined as a terminology of the named entity dictionary.

In addition, an occupation, religious faith, a field of game, and thelike, i.e., relationship names of an ontology schema, are defined as arelationship name, i.e., a key value of the mining rule database, andcan be respectively connected to a mining pattern and an RDF triplecorresponding to each relationship name.

Accordingly, the named entity dictionary and the mining rule databasecan be formed in association with the concept or the relationship nameof the ontology schema.

The named entity dictionary may connect and store authority datacomprising a named entity corresponding to a concept of the ontologyschema, a terminology classified as the named entity, an identifier ofthe terminology, a representative terminology, and an identifier of therepresentative terminology in one format. The authority data isconnected through the named entity of the terminology.

The mining rule database may store the mining rule configured with themining pattern and the RDF triple.

A mining rule is defined to include a relationship name of the ontologyschema, a mining pattern corresponding to the relationship name, and oneor more RDF triples corresponding to the mining pattern, and the miningrule database may connect and store the relationship name of theontology schema, the mining pattern, and the RDF triple in one format.

The mining pattern generation unit 120 may recognize a terminology froma text and convert the terminology into a mining pattern

The mining pattern generation module 120 may recognize a terminologyfrom a text using lexical analysis, syntactic analysis, and semanticanalysis in association with the named entity dictionary and express therecognized terminology as a named entity corresponding to a terminologystored in the named entity dictionary.

When the recognized terminology is expressed as a named entitycorresponding to a terminology stored in the named entity dictionary, ifthere is a plurality of different named entities corresponding to thesame terminology, the mining pattern generation module 120 may searchfor and extract all the named entities.

In addition, the mining pattern generation module 120 may convert a textdocument displayed including all the named entities into a miningpattern.

The named entity and mining rule search unit 130 may search for acorresponding named entity and a mining rule respectively from the namedentity dictionary and the mining rule database using the recognizedterminology and the mining pattern.

If two or more named entities are searched by the named entity andmining rule search unit 130, the named entity selection unit 140 mayselect a named entity suitable for a terminology connected to a conceptusing the mining rule.

Since the ontology schema is associated with the named entity dictionaryand the ontology schema is associated with the mining rule databaserespectively, using constitutional elements different from each other (aconcept and a relationship name, and a relationship name and arelationship name), ambiguity of the named entity can be solve using themining rule.

The verification interface 150 may display one or more of a named entityselected for the terminology, a representative terminology amongterminologies, and an RDF triple of the mining rule, and a user maydetermine whether or not the selected named entity is a named entitysuitable for the terminology.

For example, if terminologies such as ‘apple’ and ‘computer’ arerecognized from a text document saying that “Apple said computers areproducts of the company.” and a named entity of each terminology issearched for from the named entity dictionary, ‘[thing]’ or ‘[company]’can be searched as a named entity of terminology ‘apple’, and ‘[thing]’can be searched as a named entity of terminology ‘computer’.

Like this, if two or more named entities are searched as a result ofsearching for a named entity of terminology ‘apple’, a suitable namedentity needs to be determined among the two or more searched namedentities.

If the text document is converted into a mining pattern, it can beconverted into ‘% product of [?] jcs [thing] jco % self’, and if amining pattern matching to the mining pattern is searched for from themining rule database, a mining rule such as ‘% product of [company1] jcs[thing] jco % self’->‘[company1]-product-[thing1]’ can be searched.

Here, the concepts configuring the RDF triple of the mining rule are‘[company]’ and ‘[thing]’, and if ‘[company]’ and ‘[thing]’ are matchedto the searched named entity, the named entity of ‘computer’ is‘[thing]’, and thus ‘[company]’ can be selected as a named entity of‘apple’. Therefore, ambiguity of the named entity can be solved.

According to a second embodiment of the present invention, as shown inFIG. 2, a module 200 for determining a named entity of a terminologyusing a named entity dictionary and a mining rule combined with anontology schema may include a named entity dictionary and mining ruledatabase 210, a named entity and mining rule search unit 220, a namedentity selection unit 230, and a verification interface 240.

The module 200 for determining a named entity of a terminology using anamed entity dictionary and a mining rule combined with an ontologyschema according to a second embodiment of the present invention has aconfiguration omitting the mining pattern generation unit from theconfiguration of the first embodiment, and it means that it is a modulethat determines a named entity of a terminology when a mining pattern isdirectly inputted into the module 200 for determining a named entity ofa terminology using a named entity dictionary and a mining rule combinedwith an ontology schema.

The functions of the named entity and mining rule search unit 220, andthe named entity selection unit 230, and the verification interface 240of the module 200 for determining a named entity of a terminology usinga named entity dictionary and a mining rule combined with an ontologyschema are the same as those of the named entity and mining rule searchunit 130, the named entity selection unit 140, and the verificationinterface 150 of the first embodiment.

However, there is a difference in that the named entity and mining rulesearch unit 130 of the first embodiment receives a recognizedterminology and a converted mining pattern from the mining patterngeneration unit 120 and searches for a named entity and a mining ruleusing the received terminology and mining pattern, whereas the namedentity and mining rule search unit 220 of the second embodiment directlyreceives a mining pattern from a user and searches for a named entityand a mining rule using the mining pattern and a terminology included inthe mining pattern.

FIG. 3 is a flowchart illustrating an embodiment of a method fordetermining a named entity of a terminology using a named entitydictionary and a mining rule combined with an ontology schema accordingto another aspect of the present invention, FIG. 4 is a flowchartillustrating another embodiment of a method for determining a namedentity of a terminology using a named entity dictionary and a miningrule combined with an ontology schema according to another aspect of thepresent invention, and FIG. 5 is a flowchart illustrating a process ofselecting a named entity suitable for a terminology among a plurality ofnamed entities in a method for determining a named entity of aterminology using a named entity dictionary and a mining rule combinedwith an ontology schema according to another aspect of the presentinvention.

According to a first embodiment of the present invention, a method fordetermining a named entity of a terminology using a named entitydictionary and a mining rule combined with an ontology schema maycomprise the steps of: recognizing a terminology from a text S310;converting the terminology into a mining pattern S320; searching for aterminology and a mining rule from the named entity dictionary and themining rule database S330; and selecting a named entity suitable for theterminology among a plurality of named entities using the mining rule.

If a plurality of named entities is searched for the same terminology asa result of searching for a terminology and a mining rule from the namedentity dictionary and the mining rule database of step S330, a namedentity suitable for the terminology recognized from the text documentcan be determined using a mining rule associated with the ontologyschema.

For example, if terminologies such as ‘Yuna Kim’ and ‘Johnny Weir’ arerecognized from a text document saying that “It is said that Yuna Kimsaid Johnny Weir is her ideal type.” and a named entity is searched foreach of the terminologies from the named entity dictionary, ‘[person]’is searched as a named entity of terminology ‘Yuna Kim’, whereas a namedentity of terminology ‘Johnny Weir’ is not searched, or ‘[person]’ or‘[thing]’ can be searched as a named entity corresponding to ‘JohnnyWeir’.

Like this, if two or more named entities are searched or no named entityis searched as a result of searching for a named entity of terminology‘Johnny Weir’, a named entity needs to be determined.

If the text document is converted into a mining pattern, it can beconverted into ‘% ideal_type of [person1] jcs [?] jco % self’, and if amining pattern matching to the mining pattern is searched for from themining rule database, a mining rule such as ‘% ideal_type of [person1]jcs [person2] jco % self’->‘[person1]-ideal_type-[person2]’ can besearched.

Here, the concepts configuring the RDF triple of the mining rule are‘[person]’ and ‘[person]’, and if ‘[person]’ and ‘[person]’ are matchedto the searched named entity, the named entity of ‘Yuna Kim’ is‘[person]’, and thus ‘[person]’ can be selected as a named entity of‘Johnny Weir’. Therefore, ambiguity of the named entity can be solved.

According to a second embodiment of the present invention, a method fordetermining named entity of a terminology using a named entitydictionary and a mining rule combined with an ontology schema maydirectly receive a mining pattern from a user and search for a namedentity and a mining rule from the named entity dictionary and the miningrule database using the inputted mining pattern and a terminologyincluded in the mining pattern S410, and select a named entity suitablefor the terminology among a plurality of named entities using thesearched mining rule.

In addition, one or more of a terminology, a named entity selected forthe terminology, a representative terminology, and an identifier of therepresentative terminology are displayed on the verification interfaceso that a user can confirm whether or not the named entity of theterminology is suitable.

According to another aspect of the present invention, a program forexecuting the method for determining a named entity of a terminologyusing a named entity dictionary and a mining rule combined with anontology schema can be recorded in a recording medium that can be readby an electronic device.

The method for determining a named entity of a terminology using a namedentity dictionary and a mining rule combined with an ontology schema canbe implemented as a computer program, and computer programmers in theart can easily contrive codes and code segments configuring the program.

In addition, the method for determining a named entity of a terminologyusing a named entity dictionary and a mining rule combined with anontology schema is stored in an information storage medium (a computerreadable medium) that can be read by a computer, and a named entitysuitable for the terminology can be determined by reading and executingthe program through the computer.

According to the present invention, linguistic resources can be managedin an integrated manner using a named entity dictionary associated witha concept (class) of an ontology schema and a mining rule databaseassociated with a relationship name (property) of the ontology schema.

Furthermore, if a plurality of named entities corresponding to aterminology is searched from the named entity dictionary usingassociation relation of the ontology schema, the named entitydictionary, and the mining rule database, a named entity suitable forthe terminology is determined, and thus ambiguity of the named entitycan be solved.

While the present invention has been described with reference to theparticular illustrative embodiments, it is not to be restricted by theembodiments but only by the appended claims. It is to be appreciatedthat those skilled in the art can change or modify the embodimentswithout departing from the scope and spirit of the present invention.

1. A non-transitory computer readable storage medium having a modulestored therein for determining a named entity of a terminology using anamed entity dictionary and a mining rule combined with an ontologyschema, the module comprising: a named entity dictionary and mining ruledatabase for storing the named entity dictionary in which a named entityof a terminology combined with the ontology schema and connected to aconcept (class) is defined, and a mining rule database in which a miningrule configured with a mining pattern and a resource descriptionframework (ROF) triple combined with the ontology schema and connectedto a relationship name (property) is defined; a named entity and miningrule search unit for searching for a corresponding mining rule and anamed entity respectively from the mining rule database and the namedentity dictionary using an inputted mining pattern and a terminologyincluded in the mining pattern; and a names entity selection unit forselecting, if two or more named entities corresponding to theterminology are searched, a named entity matching to the conceptconfiguring the ROF triple of the searched mining rule as a named entityof the terminology among the searched named entities.
 2. The moduleaccording to claim 1, further comprising: a verification interface fordisplaying one or more of the named entity selected for the terminology,a representative terminology among terminologies, and the RDF triple ofthe mining rule.
 3. The module according to claim 1, wherein the namedentity dictionary connects and stores authority data comprising a namedentity corresponding to the concept (class) of the ontology schema, aterminology classified as the named entity, an identifier of theterminology, a representative terminology, and an identifier of therepresentative terminology in one format.
 4. The module according toclaim 1, wherein the named entity selection unit selects a matchingnamed entity by comparing the concept configuring an RDF triple of thesearched mining rule with the two more searched named entities.
 5. Amethod for determining a named entity of a terminology using a namedentity dictionary and a mining rule combined with an ontology schema,which searches for a named entity or an identifier of a terminologyrecognized from an inputted text document using the named entitydictionary in which a named entity of a terminology combined with theontology schema and connected to a concept (class) is defined, and amining rule database in which a mining rule configured with a miningpattern and a resource description framework (RDF) triple combined withthe ontology schema and connected to a relationship name (property) isdefined, the method comprising the steps of: searching for acorresponding mining rule and a named entity respectively from themining rule database and the named entity dictionary using an inputtedmining pattern and a terminology included in the mining pattern; and iftwo or more named entities corresponding to the terminology aresearched, selecting a named entity matching to the concept configuringthe RDF triple of the searched mining rule as a named entity of theterminology among the searched named entities.
 6. The method accordingto claim 5, wherein the step of selecting a named entity correspondingto the terminology using the two or more searched named entities and theone searched mining rule includes the steps of: extracting the conceptfrom the two or more searched named entities and the RDF triple of themining rule; and selecting a named entity corresponding to the extractedconcept as the named entity of the terminology.
 7. The method accordingto claim 5, wherein the named entity dictionary connects and storesauthority data comprising a named entity corresponding to the concept(class) of the ontology schema, a terminology classified as the namedentity, an identifier of the terminology, a representative terminology,and an identifier of the representative terminology in one format. 8.The method according to claim 5, wherein in the step of searching for acorresponding named entity and a mining rule respectively from the namedentity dictionary and the mining rule database using the terminology andthe mining pattern, if there is a plurality of named entitiescorresponding to the recognized terminology in the named entitydictionary, all the plurality of the named entities are searched.
 9. Arecording medium that can be read by an electronic device according toclaim 5, for recording a program for executing the method fordetermining a named entity of a terminology using a named entitydictionary and a mining rule combined with the ontology schema.